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ABSTRACT 


Laboratory procedures, mathematical theory and distri- 
bution assumptions associated with two microbiological 
testing techniques are presented. A computer simulation 
model is then formulated and programmed based on these 
procedures, and thus the influences of changes in the number 
of microorganisms per sample, distribution of microorganisms 
within the sample, number of positive gs ES. probabilitve ror 
"false positives", distribution of "false positives" and 
technician analysis times are determined. 

Using the basic simulation model as an experimental 
device, an example is presented to demonstrate its use in 
estimating the total time required to analyze a sample using 
each of the two procedures. Five variations of the basic 
model are presented to demonstrate the model's flexibility 
and sensitivity to fixing individual parameters. 

Hypothesis testing is conducted on data obtained with 
the basic model and TUE variations. A significant Z value 
was obtained with variation two in which the probability of 
a false positive was set at zero. Results of all hypothesis 
testing are presented and a discussion of model data appli- 


cation in cost analysis is appended. 
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i INTRODUC TILON 


Laboratory microbiological analysis of animal origin 
food products for the determination oí actual or potential 
health hazards is, at best, a cumbersome, time consuming 
and expensive procedure for which no perfect alternative 1s 
likely to be found in the near future. 

Further, because it is impractical, if not impossible, 
to examine samples for all potentially pathogenic micro- 
organisms, laboratory methods currently in use rely heavily 
upon the isolation and identification of members of 
indicator” groups. 

Briefly, the rationale for using "indicator" groups is 
post they are readily and reliably cultured in the 
lx»oratory and are fairly good pxedictors of general micro- 
biological quality. (1) 

Among the most widely used "indicator" а Ше аг 
which comprises the coliform organisms. These organisms 
are primarily members of the family Enterobacteriaceae, and 
the two genera Escherichia and Aerobacter supply the 
majority of the strains. The American Public Health Asso- 
ciation defines the group as "---all aerobic and facultative 
anaerobic, gram-negative, non-sporeforming rods capable of 
fermenting lactose with the production of acid and gas at 
32 degrees to 35 degrees centigrade within 48 hours incuba- 


р он сола con liquid media. Included in (his Droas 





grouping are some strains of the genera Klebsiella, 
Paracolobactrum, Erwinia and Serratia, as well as the 
Becherichia and Aerobacter. 

Food specifications require that products meet standards 
based in some instances on total coliform counts. Other 
Specifications stipulate limits for the genus Escherichia 
while still others have become more stringent and now 
mMeécuire that food producis contain no members of those E. 
Coli varieties most commonly associated with the intestinal 
tracts of man and other vertibrates. 

Laboratories responsible for analyzing products.under 
these specifications are required to perform one or more of 
the standard coliform procedures designed to enumerate the 
total coliform population of the product under examination. 
(One of these standard procedures will be discussed at 
length in the next section of this paper.) In addition, 
Mora tories must perform specific identification procedures 
on E. Coli varieties to determine whether they are of the 
ШЕС Ток which a zero tolerance has been established. 

While the total SOIT TOS procedures are fairly well 
ELOmndardized and must be adhered to rigorously by all 
laboratories, there are optional techniques available for 
performing the E. Coli typing. Laboratories operating 
under personnel, time and budgetary constraints would 
therefore derive substantial benefit from selecting those 
analytical techniques which were most efficient in terms of 
mesource utilization and, at the same time, provide an 
ecceptable degree of reliability. 
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In general, because of the large number of variables 
involved in these laboratory techniques, a Stralgqhtfervard 
Emalvtic solution to the question of which procedure 15 
most efficient dm a particular laboratory rs not available 
to the laboratory supervisor. Further, because of the time, 
expense and laboratory facilities required to perform these 
procedures, many laboratories can't conduct the additional 
testing necessary to arrive at a satisfactory solution to 


the question on an experimental basis. 


ТА sO LC TI VES 


The primary objective of this paper is to develop and 
demonstrate the use of an analytic proceduxre for evaluatino 
the relative efficiency of two microbiological laboratory 
methods. Specifically, the microbiological methods to be 
considered are coliform serotyping techniques assiciated 
with "Most Probable Number (MPN)" coliform determinations. 

The basic analytic tool to be employed in this analysis 
1S a computer simulation model. А simulation model was 
Busen because, as Naylor (2) states, simulation techniques 
allow us to conduct situational experiments that would 
ordinarily be too expensive and/or too cumbersome to perfom 
Beysically. Clearly, the laboratory procedures to be 
Medeled fit both categories. 

Secondary objectives associated with the procedures to 
be modeled and the computer simulation to be demonstrated 


ате; 





1. To present MPN theory and to describe related 
laboraiory procedures in sufficient detail for development 
of the model. 

2, To discuss the specific system to be modeled. 

3. To describe the model and variations of the model. 

4. To conduct hypothesis testing on total analysis 
time data obtained with the model and to discuss conclusions 
drawn from these results. 

Finally, Appendix 9 of this paper will consider the 
general subject of cost analysis as it relates to laboratory 
procedures of this type and, in particular, will discuss the 
application of data obtained with the basic model to the 


question of dollar cost efficiency. 


ШІЛ MENCASSUMITIONSOSAXNDOEDIEORY 


The standard "Most Probable Number™ (MPN) Coliform 
procedure forms the basis for the techniques to be modeled 
and analyzed. Therefore, a clear understanding of the 
assumptions and theory of MPN determinations is essential 
to e interpretation and application of the model to be 


presented. 


Be FASSUMPTIONS 

There are two principal assumptions. In statistical 
language, the first is that the organisms are distributed 
randomly (uniformly) throughout the sample. This means that 
an organism is equally Сету te be found in any par tee 


the sample, and that there is no tendency for pairs or 





groups of organisms either to cluster together orto repon 
one another. In practice this implies that the sample is 
thoroughly mixed, and if the volume is not too great some 
mechanical device is employed for this purpose. This will 
be discussed further in the "laboratory procedures" section 
of this paper. 

The second assumption is that each subsample from the 
sample, when incubated in the proper culture medium, is 
certain to exhibit growth whenever the subsample contains 
one or more organisms. This will be end further ta 
the "model assumptions" section under “false positives". 
SO, if the culture medium is poor, or if there are factors 
which inhibit growth, or if the presence of more than one 
organism is necessary to initiate growth, the MPN gives an 


underestimate of the true sample density. 


Bee LOE ORY 

Mathematically, MPN theory relates the probability that 
there will be no growth in a subsample to the density of 
organisms in the original sample. Suppose that the sample 
contains V ml., the subsample contains v ml., and that there 
are actually b organisms in the sample. By the second 
assumption, there will be.no growth if and only if the 
sample contains no organisms. (Disregard the possibility of 
false positives for the moment.) Then, calculate the 
probability that none of these b organisms is in the 


subsample. 





Consider a single organism. By the first assumption, 
the probability that it lies in the sample is simply the 
ratio of the volume of the subsample to that of the original 
Elo. i76. V/V. Theprobability totii is noit mni re 
Dbsanplegss Тһсегтетоде (1: - v/V De Since dhere ispa s sus 
to be no kind of attraction or repulsion between organisms, 
these two probabilities hold for any organism, irrespective 
P le positions of the other organisms. (Strictly, this 
requires the additional assumption that the space occupied 
by an organism is negligible relative to v.) Consequently, 
by the multiplication theorem in probability, the probabil- 
ity that none of the b organisms is in the sample is 

p = (1-v/v)> 
Mieemey/V 1s small, this is closely approsamated by 


р = с-УБИУ 


where e is the base of natural logarithms. Finally, since 


b/V is the density S of organisms per ml., we have 


р = oa 


where p is the probability that the subsample is sterile. 
Consider the case of a single dilution. If n subsamples, 
each of volume v, are taken, and if s of these are found to 
Memsterile, the proportion s/n of sterile samples is an 
estimate of p. Hence we obtain an estimate d of the density 


S by the equation 


This gives 





where ln and log stand for logarithms to base e and to base 
ten respectively. | 

The estimate d is the most probabi number No 
organisms per ml. of the ST cM sample. 

this case, the concept of MPN xs secoeweelysmeeced ВЕ 
becomes useful, however, in the more complex situations where 
several dilutions are used. 

If p is the probability that a sample is sterile, the 
probability that s out of n samples are sterile is given by 


Mis binomial distribution as 


t - 
ae A 


-VS 


Since p = e , this expression may be written as 


| me cus vs (1-e7V5)n-s 


en)! 

If we have obtained s sterile samples out of n, this 
formula enables us to plot the probability of this event 
against the true density S. Such curves always have a 
single maximum. 

A curve of this type suggests a method for estimating 5, 
for if we are considering two E of S, it seems 
reasonable to prefer the one which gives a higher probability 
to the result that was actually observed. This argument, 
ESrried to its conclusion, leads to a choice of S for which 
the probability of obtaining the observed result is greatest. 


ШЕ 15 this value of S that is called the "most probable 


number" of organisms in the original sample. 
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In practice, more than one dilution is usually needed. 
The reason is that the precision of the mpn is very poor 
when the volume v in the subsample is such that the sub- 
samples are likely to be all fertile or all sterile. When 
all are fertile, the maximum on the probability curve occurs 
when S is infinite, so that the estimated density is infin- 
ite. When all are sterile the estimated density is zero, 
as may be verified from the equations above. Thus a single 
emraution is successful only if v happens’ to be chosen so 
that some samples are sterile and some are fertile. Such a 
ШОО Се оГ v Gan be made only if the density S is known 
ШЕГІУ closely in advance. As a practical matter; S is not 
known in advance. In default of this knowledge, the practice 
is to use several dilutions in the hope that at least one of 
them will give some sterile and some fertile subsamples. 

To illustrate the general problem, consider the case of 
three dilutions. Let the suffix i indicate the dilution. 
ар 


Bor the 1i dilution the volume of subsample is v., and s; 


1 1 
E of n: samples are found to be sterile. How do we 
ВЕЕ пате S from these results? 

From NOS we can obtain a separate estimate for each 
emu tion EE 2.303 = (Es 
Vi n; 

However, the best way to combine the three estimates а. 
to a single value is not obvious. “Since, as we have seen, 


some dilutions give very poor estimates, it is not satis- 


factory to take the arithmetic mean. 
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One solution is provided by the MPN concept which 
E ends easily to tbus situation. Following thema rr roach 
used in the previous section, first write down the probabil- 


ity of obtaining the observed results for any hypothetical 


value of the true density S. The observed results are that 
51 samples out of n, are sterile at The first diluiion, $2 
out of n, at the second and S3 out ol n. at the third., The 


probability that these three events should all happen is the 
product of three terms. As before, the graph of this 
probability against S shows a single maximum. The value of 
S at this maximum is taken as the MPN. 

The value of the MPN cannot be written down explicitly. 


The equation it satisfies is as follows: (3) 


-v-d - vd -Vad 
P z: NEN MN 1 Ше- 521126 2 Oa УВЕ E 
En ?2V2'953V3 


ee To l-e^V398 1-е - 730 


In laboratories where the numbers of subsamples ш апа 
the dilution ratios are standardized, it is convenient to 
bave a table which gives the MPN for all sets of results 
But are likely to occur. (4) 

In the procedure to be modeled, we will only consider 
the case of three dilutions and five subsamples per dilution. 

Although the number of dilutions and replications within 
dilutions is standardized by laboratory operating procedures 
for most specification testing, an understanding of the 
Mi onale for selecting dilution and replication numbers is 


useful in those instances when a sample is expected to 


Еа папи ПА bevel or contamination. 
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Gencra TIYA in preparation for an estimation by the MPN 
procedure, three decisions must be made as follows: 

1. What range of sample volume is to be examined. 

2. What dilution factor is to be used, 

3. How many subsamples (replications) should be taken 
per dilution. 
These decisions must in some way be related to a prior 
knowledge of the limits within which the true level of 
microbiological contamination is likely to lie and on the 
precision required in the estimate obtained by this proce- 
dure. Specifically, it follows from the previous discussion 
that the best estimate will be obtained from volumes of 
sample in which it is unlikely that all replicates will be 
mtille or that all replicates will be sterile. Then, in a 
series of dilutions, the expected number of contaminants in 
the highest sample volume selected for testing should be at 
least one. Otherwise, there is a risk that all samples will 
be sterile. Similarly, the expected number of contaminants 
in the lowest а Е volume should not exceed two in order 
to avoid an unreasonable risk that catas will be 
fertile. ne this line of thought, the dilution series 
mui be able to estimate any density of contamination that 
lies between l/Highest Volume and 2/Lowest Volume. 

fhis rule is satisfactory il a Sizcable number of 
replications (twenty or more) are being taken at each dilu- 
tion. With small sample replicate numbers (five or less) 


which are required in the procedure we are discussing due to 





time and expense of large replicate numbers, the above 
generalization is too lenient in that it allows too great a 
risk that all replicates will be fertile. Suppose, as in 
our example, that we have three ten fold dilutions with 
wple volumes 1/100, 1/10 апа 1/1. Ву the generalizatıon 
above, we should be able to estimate densities between 1 апа 
200 microorganisms per ml. If, on the other hand, the true 
density of microorganisms in the sample happens to be 200 
per ml., so that the expected number of microorganisms per 
replication in the lowest sample I two, then the 
probability of a sterile sample at this dilution ise? Ол; 
0.135. The probability of a fertile sample is then 

imie- probability of a sterile sample) or (1 = 0.135 = 0.865). 
item, if five replicates are used per dilution as in our 
Ec. the probability that all are fertile is 0.8657, Ons 
ШЕШСЕ, Clearly, at the two higher concentrations all 
samples are very likely to be fertile. Thus we have at best 
a fifty-fifty chance that all samples (replicates) will be 
fertile which necessitates rerunning the sample at other 
pons to obtain cro дыш) estimate. On the other 
hand, if laboratory procedures permit and the expense is not 
too great, it might be well to consider larger numbers of 
replicates. For example, if twenty replicates were used, 
BEC probability that all are fertile becomes m | or 
only about 0.05. 


Bbexloocson to be learned from this ts that it іс 5 


to reduce the upper density when the number of replicates 
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per dilution must be small. In practice, the upper density 
MN reduced from 2/volsto"1/vol. WMhic ie n Cho r 
guessing or estimating from existing laboratory records, the 
two limits between which we can be reasonably certain that 
the true microbiological density lies. The sample volumes 
are then chosen so that the volume of the highest density 

is greater than or equal to Мата Т OL true 
density. Similarly, the volume of the lowest density is 

Ышы сп Во Бе less than or equal to l/highest estimate of 

the density. For example, if we are confident that the 
density is somewhere between a low of 10 and a high of 750 
Pr aml., the highest sample volume should be at least 1/10 
ml.. Similarly, the lowest sample volume should not be more 
ШЕН 1/750 п1.. In this example, as in our Case, three ten 
Bod dilutions 1/10, 1/100, 1/1000 would amply cover this 
range of densities. This range of densities is standardized 
Or most applications in microbiological laboratory testing 
and there is no real advantage to considering a different 

Ба соп гатіо. а E an ne total 
number of samples (replications) in the whole series is kept 
fixed, the average precision is practically the same for any 
dilution ratio between two and ten." 

Thus, in routine testing, the recommended procedure of 
ШОО Three ten fold dilutions and five replicates per dimi 
tion has proven to be the most useful combination and for 
that reason, results are tabulated (see Table 1). An exam- 
ple of the use of this table will be presented in the next 
section, 


ща 





IV. LABORATORY PROCEDURES 


Consider a sample submitted for МОМ Соз боги апо see: 
typing. This sample would be processed as follows: 

1. The sample would be thoroughly mixed with a measured 
volume of diluent in an attempt to achieve the uniformity 
of organism distribution assumed by the MPN procedure. 

2. Five subsamples are selected and diluted as shown 


in the following schematic: 


Prepared Sample (From Step One) 


Subsamples (1:1) ЕП ПІР 


ВЕ Dini p (1:10) mee 


Second Talu tirion (ieee?) 





Що 





3. Subsamples and dilutions are innoculated into 
appropriate growth media. 

4. Innoculated subsamples and dilutions are incubated 
шар twenty-four hours. 

5. At the end of 24 hours, subsamples any of whose 
dilutions are positive are transferred .to confirmatory media 
emd/or are examined individually for E. coli type. 

6. Those confirmatory subsamples which were transferred 
are examined at the end of an additional 24 hours incubation 
5.5 + „2 degrees С. If positive at this point, they are 
meafirmatory for E. coli. 

7. Individual subsamples may now be examined for E. 
coli type. Negative subsamples are observed again at the 
end of 48 hours and if negative then they are discarded. 

Results from this laboratory procedure are normally 
recorded in matrix form as follows: (Rows are dilutions and 
columns are replicates.) 


Tube Number 
Sample Number Dilution ANT 2 3 n 


1 Р due + + =- - + 
1:10 Ж к - - - 
12.1009 +f — — - - 


Each plus in the matrix represents a tube in which 
growth is observed and each minus represents a tube in which 
no growth is observed. If these results are from confirma- 
tory tubes, the MPN per 100 milliliters may be obtained from 


Me MEN table (see Table I). 
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Tabular values are related to the MPN values per gram 
of the sample as follows: 

Consider a sample in which one gram of solid matter is 
suspended in ten milliliters of liquid. In step one above, 
suppose that the sample is diluted ten fold (that is, sample 
is mixed with dilutent on a one in ten basis). Then, 
following step one our testing dilution contains one gram 
per hundred milliliters liquid volume. In this example, the 
MEN per gram can be read directly from the table. Our 
sample matrix shows three positive tubes in the 1:1 dilution, 
two positive tubes in the 1:10 dilution and one positive tube 
in the 1:100 dilution. Then, reading from the table under 
the 3-2-1 values gives an MPN per 100 ml. 02.17. 

Clearly, if the original dilution represents something 
other than one gram in 100 ml. of liquid, tabular results 
must be adjusted. This is easily accomplished by the 


following formula: 


dilution factor 
х of middle = MPN per gram 
PUE еее 


NPN from table 
100 
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Vo THE SYSTEM TO PF03000 0 [ED 


The system to be modeled is that part of the analysis 
which requires that the positive subsamples (replicates) be 
examined individually for E. coli type. As discussed in 
the laboratory procedures section, this typing may be 


accomplished in two basic ways. 


EN PROCEDURE А 

At step seven in the laboratory procedure the technician 
selects those sample fermentation tubes which show gas 
(carbon dioxide) production. Each positive tube is then 
further examined for E. coli type by a macroagglutination 
procedure in which the E. coli contaminant acts as the 
antigenic agent and illicits an agglutination of the type 
Specific antisera in one of the ten typing tubes to be 
planted, ТЕ the contaminant is not E. coli, no specific 
agglutination will be illicited from the antisera in the ten 
typing tubes and it may be concluded that the contaminant 
was not E. coli or, more generally, that the fermentation 
tube had shown gas production due to any one or more of a 
Uude variety of nonspecific causes all of which will be 
treated under the general classification "false positive'. 
It will be noted that a false positive required exactly as 
much technician time to examine as did the tubes in which 
E. coli was present. In terms of resource utilization, this 


procedure can result in fewer total serotype tubes implanted 
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and examined and if the number of positive confirmatory tubes 
is small there may be a significant saving of пес анай time. 
ГО PROCEDURE B 

At step five in tbe laboratory procedure the technician 


can implant ten subgroup (serotype) tubes at the same time 
the confirmatory E. coli tubes are being implanted. This 


routine offers the advantage of saving technician time 
during the implanting procedure but clearly requires that 
the technician implant a large number of tubes for each sam- 
ple (50 tubes per sample). Samples for analysis will be 
generated by the model on the basis of distribution assump- 
Bons in the MPN procedure, Individual technician times, 
numbers of contaminants per sample, and the occurrance of 
false positives are arbitrarily established for demonstration 
purposes only. А11 parameters in this system except those 
related to the basic MPN assumptions could be easily and 
eekly determined in the laboratory prior to application of 
the model for a specific laboratory — 

In order to make this model as general as possible, 


positive tubes within.a dilution are referred to as anti- 


genic groups. Similarly, positive serotypes within a group 


Ee referred to as antigenic subgroups. Further, rather than 


restrict the nomenclature in the model to coliform groups, 
all organisms in a ee referred to as microbiological 
contaminants. Hopefully, these generalities will encourage 
readers to examine the possibility of applying the model 


Шота variety of laboratory procedures. 
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Ут. DESCRIPTION OF Ste GbE 


BEL LOW CHART 


A flow chart of the program is attached as appendix 1. 


ГО EXPLANATION OF PROGRAM LISTING 


Ш ИАТКІХ - Represent- the sample to be analyzed. The five 


N 
IX,KX,MX 
LA 

LB 

NAT 

NBT 
LAS,LBS 


UMLAS 


UMLBS 


rows of the matrix represent the five replicates 
(subsamples) which are referred to as Antigenic 
Groups rand the ten columns cpl Caen сеже = 
tubes referred to as Antigenic Subgroups. 

Counter used in the program to keep track of the 
number of samples analyzed. 

Counter to determine the number of microbiological 
contaminants entered in the sample matrix. 

Number of samples to be analyzed. 

Seed values for the random number generator. 
Calculated time required for a technician to 
analyze one sample using procedure A. 

Calculated time required for a technician to 
analyze one sample using procedure B. 

Random time required for analysis of one replicate 
(group) using procedure A. 

Random time required for analysis of one group 
using procedure B. 

Square of LA апа LB. 

Sum of squares of LA. 


Sum ol secuares"of LB. 


a 





Dum 


NG 


RX 
IROW 
JCOL 
TEMEA 
ШИЕ Б 
TTIMEA 
Ом В 
ERIMEA 
EFTMEB 
CTIMEA 
СЕМЕ В 
DTIMEA 
DTIMEB 


QTIMEA 


QTIMEB 


ЖТА Т 


Number of microbiological contaminants in a sample. 
Number of positive replicates (groups) in the 
confirmatory MPN tubes. 

A uniformly distributed random variable from O to l. 
A random group to be included in the sample. 

A random subgroup to be included in the sample. 

Sum of analysis times for procedure A. 

Sum of analysis limes for procedure В. 

Mean of analysis times for procedure A. 

Mean of analysis iimes for procedure B. 

Variance of analysis times for procedure A, 
Variance of analysis times for procedure B. 

95% lower confidence limit of mean for procedure A. 
95% lower confidence limit of mean for procedure B. 
95% upper confidence limit of mean for procedure A, 
95% upper confidence limit of mean for procedure B. 
Standard deviation of analysis times Гот 

procedure А r N 

Siandard deviaiion of analysis times for 

procedure B / JN. 

Calculated Z value for testing the null hypothesis 
of no difference between mean analysis times for 


the two procedures. 
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VIT. ПО О ЕКІСІ ИП Те, 


A matrix of sample contaminants is generated and printed 
Se as follows: 


Antigenic Subgroup 


ще, 2» 2 5 Ден NEU 

Antigenic Group П О О О О | О О О ji О 
бос 1 саласы с ас CO ССр 

3 О О О О О О О О О О 

4 О О N 1 O O O O O O 

5 П О О О О О 1 О О О 


Where the 1's indicate that a contaminant is present and 
the O's indicate that no contaminant is present. As stated 
earlier, the antigenic groups 1 thru 5 correspond to the five 
subsamples (replications) prepared for the MPN procedure and 
the antigenic subgroups correspond to the ten possible 
(hypothetical) serotypes of the microbiological contaminant. 
Random variables for these entries are generated by the 
simulation model based on the assumption of normality in 
organism distribution from the MEN theory. 

The computer first generates a random variable for 
matrix row (group) and then generates a random variable for 
matrix column (subgroup). These two numbers identify the 
specific tube in which a microbiological contaminant will 
be entered. The computer then scans the matrix (sample) and 


Авеста 11а the proper row and column. If a 1l has 


о 





previously been entered in that matrix row and column, the 
computer generates a new random variable for.matrix row and 
a new random variable for matrix column and repeats the 
above process until the matrix (sample) contains the 
specified number of microbiological contaminants. 

The computer then counts and records the numbers of 
mecitive groups (including false positives) in cach gener- 
ated sample, prints it out, computes technician times for 
the sample by each of the two procedures and calculates 
statistics on means, variances, confidence intervals and 


Z values for means according to the following scheme: 


X = Sample Mean 
М = Population їз 
ЕС = Sample Variance 
= Sample Standard Deviation 
gê = Population Variance 


Об = Population Standard Deviation 


Theory - For large N (by the central limit theorem) 


JN. ( м) = м(0,1) 


X = 
г 
then, P(-1.96 + МА RE = 1.96) = .95 


2 2 Е 
and, using s as an estimate forr? this becomes 








Р(Х МОС OG 


ИМ. IN 


y = 05 


for the 95% confidence interval about the sample mean (X). 
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The model computes the values by keeping a running sum 
of total times for each procedure (TIMEA and TIMEB), a 
running sum of squares of total times (UMLAS and UMLBS) and 
number of samples processed (N). After completing all 
sample processing, the model computes sample means (X) by 
dividing TIMEA and TIMEB by N. 


Sample variances are computed by the equation 


2 
> (EX) 
2 2 Xi - N 
S = 
N-1 


For computational convenience and because of large N in the 


exercise, this is computed in the model by 


беу 





J 
= N N N 
2 
aes 
N N 


then, from the values calculated by the model for the above: 


BTIMEA = “МАЗ - (TTIMEA)” 

and, similarly 
UMLBS 2 
BTIMEB = — = - (ТТІМЕВ) 
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then, factors for 95% confidence limits are computed 


QTIMEA = VBEIMEA 
им 
Sam larly, 
Vau MED 
OTIMEB -» ————— 


YN 


The hypothesis testing for differences between means is 
conducted as follows: 

x, and X, are the sample means obtained from large 
sample of size N drawn from populations having means u] 
and =. and standard deviations Vi and 2 Then we can 


test the hypothesis of no difference between means (мат мо) 


пиша the statistic 


Z = — Е 
DS 
(X,-X,) 
where 
2 2 
c s BEES 
(X,-X,) = 1 2 


Here, the Z statistic is used rather than the t statistic 
because of the large sample size (400). In the model, the 
Z statistic is computed as 


Е: ТГВ 


> = 
пай ВТ: МЕА + ЕТЕМЕВ 
FA N 


Then, referring to the Normal probability tables, for a two 


part test" and .05 ете of significance: 
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l. If the Caleuleteds7 value iSigrea lemon а Е 
less than -1.96, reject the hypothesis. 

2. lf the calculated Z value is less than 1.96 and 
greater than -1.96, accept the hypothesis. 

See Table 2 for a summary of results obtained with the 
basic model and five variations in which one or more of the 
variables is fixed (held constant). These variations will 
be described in the next section and will be discussed 


individually in Appendices 4 - 8. 


МЕТ. VARIATIONS OF TAE MODEL 


Five variations of the basic model were used in order to 
demonstrate the flexibility of the model and the overall 
enge in results due to fixing individual variables. In 
each variation, the random meses process is unaltered by 
the process of fixing a variable. 

The five variations are as follows: 

l. The number of contaminants (NT in the computer 
program listing) oe fixed. (Appendix 4) 

2. The probability of a false positive was set at 
zero. а 5) 

3. The analysis time for technician on procedure A was 
fixed at seven minutes per positive group. (Appendix 6) 

4. The analysis time for technician on procedure B was 
fixed at seven minutes per positive group. (Appendix 7) 


5. Both technician times were fixed. (Appendix 8) 
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IX. VERIFICATION OF RESULTS 


Verification of results red with the basic model 
and the five variations was accomplished manually as follows: 
I. in order to verify tbe individual sample matrices, 
dam initial run using a sample size of twenty, in which the 

basic model prints out each sample matrix number, the 
complete matrix, the identity and number of groups, false 
positives, analysis times for each sample and procedure 1s 
attached as Appendix 2. The entries in each matrix were 
verified by counting them individually and comparing the 
results with those tabulated by the computer following each 
sample. (See table in Appendix 2) 
2. Confidence limits were verified manually by computing 
the results individually as shown in the following example. 
For the basic model - Procedure A - Appendix 3 


ze 2 
N = 400 224297 S = 80.567 


/80,56 
СЕ е у з лс с 
400 


34.97 + .878 


II 


Upper C.I, 


д 


35.848 


24:97 2:878 


! 


Lower C.I. 


R 


34.092 


Rounding these gives the values in Table 2 and in Appendix 3. 


се 





3. Z values were verified manually as shown in the 
following example. 


For the basic model - Appendix 3 


Ху + X> 


⁄ = 


_ 34.973 - 34.937 
80.568+50.808 
400 


TOS 


2.326 


= .O61 


Computer value from Table 2 (and from Appendix 3) = .06105. 


ЭХ еее 20505 


Results obtained with the basic model ein’ the five 
variations are summarized in Table 2. Conclusions based on 
these results ARE follows: 

Ще For the basic model and all five variations, it must 
be concluded that the true population mean analysis times 
lie between the 95% confidence limits shown in the table 
unless a one in twenty sampling error has been made. 

2. For the basic model and variations 1, 3, 4 and 5, 
the hypothesis of no difference between mean analysis times 


must be accepted. От, stated another way, we must conclude 


that the observed differences between mean analysis times 





for the two simulated procedures is due to chance alone at 
tis level- of significance: 

З. For variation 2, the hypothesis ot ПО fcm me 
between mean analysis times must be rejected. Thus we may 
conclude with 95% confidence that there is a real difference 
between mean analysis times, and, because the Z value is 
negative, that procedure А 15 significantly better than 
procedure B. In fact, referring to the Normal probability 
tables, it can be seen that with a Z value this large, our 
confidence in this conclusion can exceed 99%. Having 
obtained a Z value this large with variation 2, the labora- 
tory supervisor might well pursue the question of false 
positives further by performing a sensitivity analysis on 
the range of probabilities from O to .2 and thereby identify 
the specific level of false positives necessary to produce 
a statistically significant difference between the two 
simulated procedures. That is, find the probability level 
for false positives at which the Z value no longer exceeds 
1.96. (See Appendix 5) 

In summary, it MS be recalled that all parameter 
assignment in the preceeding example was arbitrary and that 
conclusions based on these hypothetical values are not 
intended to imply that Procedure A is, in general, better 


tnan Procedure B. 
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TABLE I (Continued) 


Most Probable Numbers Per 100 ml. of Sample, Planting 
5 Portions in each of 3 Dilutions in Geome ло = сажа 


Positives Positives Роза ра мес 
with with with 
10 ША ОТ MPN 110, Leo MPN lO 12729237 MPN 
ШІН ml. ml. mls mim zem e Te 
3 O O VEG 4 O O 13 5 O O 23 
E O 1 11 4 О 1 ТА 5 О j BI 
3 O 2 13 4 O 2 21 5 О 2. 43 
3 О 3 16 4 O 3 25 5 O 3 58 
B O 4 20 4 O 4 30 5 О 4 76 
E О 5 23 4 O 5 36 5 О 5 95 
E ] O 11 4 | О 17 5 1 О 33 
3 1 [| ша 4 1 1 2p 5 1 1 46 
3 1 2 I7 4 1 2 26 5 1 2 64 
3 1 E 20 4 1 © ° 5 al 3 84 
3 1 4 23 4 1 4 36 5 | 4 110 
3 1 5 27 4 1 5 42 5 1 5 130 
E 2 О 14 4 2. О 22 5 2 O 49 
3 2 1 17 4 2 1 26 Э 2 l 70 
3 2 2 20 4 D 2 32 5 2 2 95 
3 2 E 24 4 2 3 38 5 2 3 120 
3 2 4 d 4 2 4 44 5 2 4 150 
3 2 5 3] 4 2 5 50 5 2 5 180 
5 6 O И 4 3 О 27 5 © О 79 
3 3 J 21 4 3 l ва 5 E 1 110 
E = 2 24 4 3 2 39 5 3 2 140 
3 E 3 28 4 3 3 45 5 © 3 180 
E 3 4 31 4 3 4 52 5 © 4 210 
3 ES 5 35 4 3 5 59 5 3 5 250 
E 4 О 21 4 4 O 34 5 4 O 130 
B 4 1 24 4 4 1 40 5 4 1 170 
3 4 2 28 4 4 2 47 5 4 2 220 
3 4 3 ща 4 4 3 54 5 4 3 280 
3 4 4 36 4 4 4 62 5 4 4 350 
E 4 5 40 4 4 5 69 5 4 5 430 
B Б О 26 4 5 O 41 5 5 О 240 
B 5 | 29 4 5 1 48 5 5 1 350 
3 5 2 32 4 5 2 56 5 5 2 540 
3 5 3 237 4 5 3 64 5 5 ES 020 
3 5 4 41 4 5 4 72 5 Б 4 1600 
3 o 5 45 4 5 Б 8l 
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Model 


Basic 
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Мат. 
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Procedure 
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Ә > ш > 
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Меап 
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555 
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33.00 
34.79 


33.00 


TABLE 


Lower 
34.09 
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31205 
34.24 
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35.00 
34.04 


35.00 


2 


95% 


Summary of Means and Z Values 


Сопгъвбепсе imits 


Upper 
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32.65 
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25,585 
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Statistics | 
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СОЕ TZ 


This appendix is included for the purpose of displaying 


the basic fortran program used in this model and to illus- 


trate the procedure used to manually verify the model. 


Verification of Computational Procedures 


Individual samples shown on pages 


appendix are counted and listed below: 


Sample Number 


1 


2 


10 
ji 
12 
13 
14 
15 
16 
"n 
18 
19 


20 


Positive Groups 


3 


3 


Computer Count 


>33 


Manual 


3 


3 


through 


Count 


Of thas 


Deviation 


O 





Thus, it is readily seen that there is no difference between 
manual counts of positive groups and computer counts. 
Further, Z statistics can be verified manually from results 
shown in Appendices 3-8. 


Consider the data in Appendix 3 for example: 


= 34.97249 - 34.903750 


O. 5070541950 50659 
400 


_ .03499 


_ /.328 


ооо 
Ton 


с ОБ 


Computer Value = .06105 
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APPENDIX 3 


This is the basic model in which none of the variables 
is fixed. Therefore, results with this model should 
Md cate: most accuratelywif there is a sionificant difíer- 
ence between analysis times for the two e 

The calculated Z value of 0.06 requires that the null 
hypothesis of no difference between mean analysis times for 
the two procedures be accepted at the .05 level. Thus, it 
can be concluded that for the ranges of sample contaminants, 
technician times, level of false positives and number of 
positives within samples chosen for this demonstration LU 
we can have 95% confidence in stating that there is no 
difference between the analysis times required for the two 
procedures. Ох, stated another way, we must conclude that 
the observed difference between means is due to chance at 


Mas level of confidence. 
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COMPUTER OUTPUT 


RESULTS FOR PROCEDURE А 


NUMBER OF SAMPLES ANALYZED 400 
MEAN OF ANALYSIS TIME WAS 34.97249 
95% LOWER CONFIDENCE LIMIT 34.09283 
ES MER CONFIDENCE LIMIT 35405213 
VARIANCE OF ANALYSIS TIME 80.56763 


111 FOR PROCEDURE Б 

NUMBER OF SAMPLES ANALYZED 400 
MEAN OF ANALYSIS TIME WAS 34.93750 
95% LOWER CONFIDENCE LIMIT 34.23895 


2S oles kh CONREDENGE LIMLT 35.03003 
WARIANCE OF ANALYSIS ТЕМЕ  50.90859 


ІНЕ СЭӘТАТІЗТІС FOR MEANSEES 0.00105 
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APPENDIX 4 


in this variation of the basic model the number of 
contaminants in each sample to be analyzed is held constant. 
The purpose of this variation is to observe the effect of 
fixing sample contamination on the calculated Z value. In 
terms of laboratory application, this models the procedure 
of performing a large number of analyses on identical 
samples (Өтес сана a ab the same number Of contaminants k 
This result clearly can't be obtained with any degree of 
accuracy in the laboratory and is included to demonstrate 
the power of simulation techniques such as the model 
presented. 

mine Calculated Z vaiue of O.27738 requires that the 
null hypothesis be accepted but clearly gives a larger Z 
value than the basic model which indicates that there is a 
Heme significant difference between mean analysis times 


with this variation than with the basic model. 
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COMPUTE RSOUTPUT 


RESULTS FOR. PROCEDURE А 


NUMBER OF SAMPLES ANALYZED 4.00 
MEAN OF ANALYSIS TIME WAS 35.48749 
95% LOWER CONFIDENCE LIMIT 34.81630 
SIS UPEER CONPIDENCE LIMIT 30215360 
VARIANCE OF ANALYSIS TIME 46.90576 


RESULTS POK PROCEDURE B 

NUMBER OF SAMPLES ANALYZED 400 
MEAN OF ANALYSIS TIME WAS 35.34999 
95% LOWER CONFIDENCE LIMIT 34.64754 


dao WPERER CONFIDENCE LIMIT 36.05225 
КЕПКЕН OF ANALYSIS "TINFZES5I1. 37817 


tee O TTA TISTIC FOR MEANS WS 0227738 
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APPENDISZS 


In this variation, the probability of a "false positive 
was set at zero. The result 1s as might be anticipated, in 
that the number of samples analyzed under procedure A is 
reduced and the analysis time 1s shortened considerably. 

The negative Z value indicates that the times for pro- 
cedure B were greater than the times for procedure A and, 
the hypothesis of no difference between mean analysis times 
is rejected with the calculated Z of -5.71084. Thus, under 
the conditions of this demonstration it can be concluded 
with 95% confidence that there is a difference between means 
and, because the Z value is negative, that procedure A is 
significantly better then procedure B. In fact, referring 
to the Normal probability tables, it can be seen that with 
a Z value this large our confidence can exceed 99%. A 
sensitivity analysis was performed with the following results: 


Prowa ba Vaty sot va 


ЕАО сея vU MG 
EX] = 
E ESO 
la тое 
2:112 zum 9 


Thús, the critical value of probability for false 
positives is slightly less than ,112, that is, as the 
probability of a false positive approaches .111 from above, 
the Z value reaches the point (-1.96) at which the hypothesis 


must be rejected. 





ERMPUTER OUTPUT 


RESULTES FOR PROCEDURE A 


NUMBER OF SAMPLES ANALYZED 400 
MEAN OF ANALYSIS TIME WAS 31.849099 
Bo LOWER "CONFITDENCE LIMIT sires slo 
095% UPRER CONFIDENCE LIMIT 22204070 
VARIANCE OF ANALYSIS TIME  66.10791 


RE SUETS-EORZEROZEDURE В 

NUMBER OF SAMPLES ANALYZED 400 
MEAN OF ANALYSIS TIME WAS 34. 93750 
95% LOWER CONFIDENCE LIMIT 34.238095 


p EUPPEROCONPFTDENCB- LIMIT 35.056092 
VARTANCE OR ANALYSIS TIME 50.80859 


БИН SIATISTIC FOR MEANSEM S. 5270094 
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APPENDIX 6 


In this variation, the analysis time for a technician 
to examine one group under procedure A was fixed at seven 
minutes per positive group. As expected, the variance 
dropped from 80 plus with the basic model to 59.66992 with 
Hus model. This is an indicator of the overall contri- 
bution of variation in technician time (between technicians) 
to the variance of the procedure. No significant difference 


не Z value is observed. 


“ 
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COMPUTERTOUTPFUT 


BEESULTSSGHORCPROCEDURE А 


NUMBER OF SAMPLES ANALYZED 400 
MEAN OF ANALYSIS TIME WAS 34.79250 
95% LOWER CONFIDENCE LIMIT 34.03548 
95% UPPER CONFIDENCE LIMIT 35.54950 
VARIANCE OF ANALYSIS TIME 59.006992 


BE SVUPTSZEOR-ERSKCEDURE ZB 

NUMBER OF SAMPLES ANALYZED 400 
MEAN “OF ANALYSIS TIME WAS 34.93750 
05% LOWER CONFIDENCE LIMIT 34.25895 


0572 ПЕРЕН СОМЕІрЕМСЕ LIMIT 355703608 
VARIANCE OF ANALYSIS TIME  50.80859 


I Z STATISTIC FOR MEANSWIS 0227301 
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APPENDIX 7 


In this variation, the analysis time for a technician 
on procedure B was fixed at seven minutes per group. As 
expected, the variance in results for procedure B dropped 


O zero. This serves as a further check of the validity 


of the program. 
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Ее РОТЕК OUTPUT 


¡RESULTS FOR PROCEDURE А 


NUMBER OF SAMPLES ANALYZED 400 
MEAN OF ANALYSIS TIME WAS 34.097249 
95% LOWER CONFIDENCE LIMIT 34.09283 
95% URPER CONETIDENCETEIMIT 37555213 
VARIANCE OFMANALYSIS TIME 207506763 


ВЕ ВОК PROCEDURE В 

NUMBER OF SAMPLES ANALYZED 400 
LESNO ANALYSIS TIME WAS 35. 00000 
95% LOWER CONFIDENCE LIMIT 35.00000 


Seo UPPER CONPIDENCE. LIMIT 35700000 
VARTANCET OF- ANALYSIS TIME 0.00000 


IRE Z STATISTIC FOR MEANSTIS -0.06130 


Ей 





RETENDI 2 ES 


As a final check on the operation of the computer 
program with parameters fixed, both technician times were 
fixed. The results confirm those obtained in appendices 6 
and 7 for variances of the two procedures. Further, the 
Z value of -0.53725 remains in the acceptance range, further 
demonstrating the effect of technician time between the 
two procedures. These could be considerably more signifi- 
cant in a situation where there were either more technicians 
involved in the procedures or where the variability between 


individual technician times was greater. 
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COMPUTER OUTPUT 


BE SUETSZERORZERNERURFZR 


NUMBER OF SAMPLES ANALYZED 400 
MEAN OF ANALYSIS TIME WAS 34.79250 
95% LOWER CONFIDENCE LIMIT 34.03548 
295% ОБЕ CONFIDENCE LIMIT 35,52950 
VARIANCE OF ANALYSIS TIME 59.606992 


FESULIO PORT PRA EDURETE 
NUMBERSOPESAMPLES ANALYZED 400 
MEAN OF ANALYSIS TIME WAS  35.00000 
95% LOWER CONFIDENCE LIMIT 35.00000 


25% UFL R CONFIDENCE- LIMIT 35.00000 
VARIANCE OF ANALYSIS TIME O.O00000 


Е eels TTC WOK MEANS 18522053725 
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APPENDIX 9 

The objective of this appendix is to present a general 
er eussion of cost analysis as it might be applied lo ihe 
question of choosing between laboratory procedures based on 
imal Cost. Specifically, applications of data obtalmed 
with the simulation model to cost analysis will be discussed, 
Further, because computer facilities may not be readily 
available to the laboratory, mathematical estimation pro- 
cedures which may be employed without the simulation model 
will be presented. 

Costs associated with the laboratory procedures of 
interest will be categorized and discussed individually. A 
model for treating the uncertainty associated with these 
costs will be described. Categorization is an important 
step in preparing a cost analysis and should not be skipped 
over lightly. One sure way io minimize cost in any analysis 
is to overlook or purposely omit some relevant cost. The 
decisionmaker should not permit this to happen without good 
justification. A laboratory supervisor can easily obtain a 
EBaccrse and reliable estimate of some of the costs of a 
laboratory procedure. That data alone, however, is not 
ically helpful in many instances. It is very difficult to 
make a rational choice between proposed laboratory procedures 
A and B, no matter how detailed and precise and dependable 
the cost figures, if the figures en only some 


ШШ КҮГЕ БООЛОП ОГЛЫ total analysis Cost of each 
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procedure. The decisionmaker needs to compare, as well as 
he can, their respective total costs. 

Thus, the real challenge facing the individual preparing 
a cost analysis is to be as comprehensive as possible in the 
analysis. Because there are a few readily identifiable 
costs that can be conveniently identified, measured, and 
evaluated, we focus attention on these and give little, if 
any, attention to those costs that are less easily identified 
measured and evaluated. 

Clearly, there is a difference between dollar expendi- 
tures during a period of time and total cost during that 
same period. If the laboratory supervisor is limiting his 
analysis to that portion of cost associated directly with 
immediate dollar outlay, this cost might well be labeled 
usar expenditure" rather than "total cost”. Most costs 
can, at some point, be translated either into dollar expen- 
ditures or expenditures of resources that can be evaluated 
in terms of dollars. However, there is another category of 
ВЕСЕ that fall into neither of the above dollar categories. 
This includes such intangibles as "convenience", "accepta- 
bility" and the like. Clearly, these must be taken into 
consideration by the laboratory supervisor but for purposes 
of this discussion on cost analysis, these intangibles will 
be ignored. 

Generally, the laboratory supervisor is required to 
perform cost analyses on procedures in operation for bud- 


getary or other administrative purposes. However, cost 
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analysis is also indicated when the cost of equipment and 

jm agents ts sufficiently high towvarrant an investigation of 
the trade-off between total analysis time and total analysis 
Cost. 

Clearly, the procedure that requires significantly less 
analysis time, costs less to perform and provides an 
acceptable level of reliability is the procedure to select: 
On the other hand, when expendable costs associated with a 
procedure is low, it seems reasonable to select those 
procedures which require less analysis time as in the 
example presented with the simulation model. 

Our primary interest is in examining those procedures 
which pose a question regarding the additional cost associ- 
ated with saving analysis time. Ог, stated another way, 
how much additional analysis time will we expend in order to 
save dollar costs. Finally, since our other variable, time, 
also costs money in the laboratory we must aggregate time 
with other cost considerations previously mentioned into 
one workable model and solve the problem: 

Minimize: Cost of Analysis 
Subject (ORe rabili ty Constraint: 

In most laboratory procedures, the question of reliabil- 
ity is dealt with first. More precisely, most laboratory 
supervisors will not be faced with the problem of selecting 
between procedures which do not meet a minimum level of 
ngu Тал и стесресла 1 То) true if the loboratoer Jus 


епаасеа иис опас саса сиса у control work for which most 
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of the laboratory procedures are rather clearly spelled out 
mi contractuai publvcations,. "Бегетоте, тео Бога ое 
supervisor need only examine the question of minimizing 
cost. 

Laboratories wishing to use cost analysis as a decision 
tool will generally fall into one of the following 
categories: 

1. Case 1 - The laboratory has been performing а 
procedure routinely for an extended period of time and has 
decided to consider an alternative (but similar) procedure. 
In this case, the cost analysis will be fairly straight- 
forward because the laboratory can use data on hand from the 
current procedure and either simulate or estimate by direct 
mathematical means the relevant parameters for the new 
puscedure. 

2. Case 2 - The laboratory is interested in selecting 
the most cost efficient of two procedures which have not 
been performed in the laboratory on a routine basis. In 
this case, data relevant to these procedures will not be 
readily available to is analyst and must, therefore, either 
be obtained from an outside source (such as another labora- 
tery) or collected experimentally in the laboratory. 

Ihe value of data obtained from another laboratory may 
be of questionable value unless the analyst has first hand 
knowledge of the circumstances surrounding the collection 
and compilation of the data. Because there is normally a 


great number of areas in which laboratories differ, the use 
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of data obtained from outside laboratories must rank very 
low in the order of preference for data sources. 

A preferable approach, if resources permit, is to 
perform both procedures on an experimental basis in the 
laboratory, collect data and base decisions on that data. 
comet 1s impractical to perform both procedures om an 
experimental basis, as is often the case, then simply 
select one of the procedures on an intuitive basis and use 
it for a reasonable period. When sufficient data is avall- 
able, either model the second procedure using data obtained 
ШЕСІ Тіс first and/or estimate parameters mathematically 
based on data from the first. In any case, it seems reason- 
able that data collected in the laboratory by making direct 
observations of the personnel and laboratory environment in 
question is preferable to using data obtained in another 
laboratory with different personnel working in a different 
environment. 

The point is that results obtained with either a simu- 
lation model or a direct analytic model are no better than 
the data entering the model. Therefore, as much care as 
seems appropriate should be exercised in choosing the data 


base for a cost analysis. 


Data Base 
In order to make this discussion relevant to the type of 
procedures under consideration in the simulation model, all 


cost data will be discussed in terms of the positive group 
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unit. At the same time, the general approach 
in this presentation is equally applicable in 
to laboratory procedures for which the sample 
readily divisible into identifiable groups or 
The first step in preparing a cost analys 
procedures is to categorize the costs associa 
procedures. Keeping in mind the basic requir 
costs be categorized as comprehensively as se 
to the procedures in question, the following 


are established: 


Cost Categories 


Ма able Direct 
Time related IS Technion aN 1. 
2. Facilities 2% 


Eosstive group 
mera ted l fl o or з 1. 


2. Glassware 


3. Ľgüipment Maint. 
and calibration 


Fixed l. Reporting 1 


Terica D 
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to be employed 
most respects 
unit is not 
subgroups. 
is for these 
ted with these 
ement that 
ems appropriate 


Cost categories 


Indirect 


Storage loss 
Samples not 


tested 


Procurement 
and supply 


General Admin. 


Overhead 
(Janitorial; 
utilities; cte) 





Step two in the costing process is to obtain values for 
each cost input and then to^combine the indrvidual anput 
Sects into the appropriate variable and fixed cost cale- 
gories shown in the table above. If the analyst has constant 
or very predictable values for each input in a cost category, 
then the individual input costs need only be added together 
to obtain a category cost value. The term "very predictable" 
in this context is used to describe a value for which the 
Variance is insignificant or has Кооп accurately established 
by some reliable means. 

Generally, the individual costs in each category are 
neither constant nor very predictable and, therefore, it is 
necessary to consider the question of uncertainty associated 
with each input in the cost analysis. 

Although most of the individual inputs in each of the 
categories of variable and fixed costs are self explanatory, 
a brief discussion of the cost estimating aspects of each 
and an approach to the question of treating А се ДО 
follows. 

To the laboratory supervisor who is not firmly grounded 
in probability and statistical theory, the question of 
treating uncertainty in a cost analysis of this type may 
seem overwhelming. The unfortunate result is that a cost 
model which ignores uncertainty is often employed. Clearly, 
what 1s required is a model which permits the laboratory 
Supervisor to improve cost estimates by considering uncer- 
tainty associated with inputs and, at the same time, does 


i 
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not require an unrealistic investment in data collection or 
statistical analysis for each input parameter. 

One model which fits this basic criteria is presented in 
Brand technical publication (6). This model requires That 
the analyst know only the lowest possible, most likely and 
puchest possible (denoted.by L, M and H) values for each 
input parameter to be used in the model. Further, it must 
be assumed that there is a ten percent probability of the 
actual value being lower than L and a ten percent probabil- 
ity of the actual value being higher then H. Then, a simple 


approximation of the expected value or mean becomes 


хг аху Ху 


х = 6 


and, employing the assumptions above, the range Xy mer. 
varies between 2.5 and 2.9 standard deviations for a wide 
class of distributions including rectangular, exponential, 


triangular, normal and beta. Thus we write. 
NM SEE ES 


where @ is the standard deviation. Then, 


Application "ot this model to the cost categories listed 
ШІ, е(ср one is as follows: 
КО TIME RELATED: COSTS 

Obtain values of L, M and H for each of the costs in 
this category and denote each as shown in the individual 


varıablezseetsıons below. 
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1.  TéGbmician - Denote С, мМ ева Has Dio Oin очае 
These values can be obtained from personnel or finance 
offices for each technician and then a weighted average 


calculated for Dim: 


2. Facilities Utilization - Denote L, M and H as by, 


b and b, For most laboratory procedures, the facilities 


2M 


utilization costs include such items:.as laboratory bench 


H' 


space, associated instrumentation, holding facilities, 
incubation facilities and the like. 

STO ta ce” Loss - Denote these as b3¡, Dam and boy. 
@osts in thas atem are those resulting from holding or 
storing quantities of the product while laboratory analysis 
meen progress, That is, the Additional storage costs 
incurred by the delay in obtaining laboratory results. 

Sample Satin tested = Denote these as b AL? b IM and Day” 
INES costs refer to loss and/or deterioration of product 
held for which testing is not accomplished due to utiliza- 
tion of laboratory resources for other testing procedures. 

Now, although we have no real idea of the exact shape 
or characteristics of the time related cost distribution 
which we are attempting to describe, the expected value 
(mean) and standard deviation may be estimated by the 


“ОТ Тоу ng: 
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Then, the mean is 
=> DEED 
Dus coc EE 

) 


emu the standard deviation 1s 


OS T TIVE GROUP RELATED COSTS 

Obtain values of L, M and H for each of the costs in 
this category and denote each in a manner similar to that 
for time related costs. 

1. Reagents - Denote these as C,;, Cıy and Сън: Оп а 
per positive group basis, the variance associated with these 
costs should be reasonably small and, therefore, should not 
be a real problem to estimate. 

2. Glassware - Denote these аз Со, Coy and Copy. This 
cost item is intended to include preparation, handling, 
replacement and loss resulting from the analysis of a 
positive group. In general, it should also include those 
mems Of Cost resulting from’ preparation and handling of all 
appliances and utensils employed in the procedure. 

3. Equipment - Denote these as Car > Cay and Czy. This 
item is intended primarily to include those maintenance and 
calibration costs associated with balances, recorders and 
similar equipment which result directly from the performance 


of the laboratory procedure in question. 


4. Procurement and Supply - Denote these as Ca» САМ 


and Сан" This item is self explanatory but miaht be one of 


the more dilficult to estimate. 
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Now pilet c iL 


HP 
11 
-- 


Ma 


1 


С) 

II 
Ma 

O 


1M 


p 
I 
-- 


O 
L 
И 
Ma 
O 


p 
Ц 
= 


LH 


Then the mean is 
E Ст +4См+©ң 
u 6 


ol 


Ea the standard deviation is 


ШЕ ГТХЕП COSTS 

Unlike the two categories above, fixed costs will be on 
a per sample basis. Further, because the relative variance 
associated with these costs is small compared to the 
variances associated with the two categories above, these 
costs might be treated as constants. 

W Reporting ~ Denote this as ау. 
The process of reporting on most analytic procedures of 
interest in the laboratory consists of entering raw data on 
a standard reporting form and delivering it to the admin- 
istrative office for further processing. Therefore, the 
between sample variance should not be too great. 

2. Clerical - Denote this as = 
Typing tes results from analyses in the laboratory is 


a fairly standard procedure and, clearly, it requires no 
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more effort to type 1. 000 MPN ithan to type 100 MEX T тары 
I should say very little more effort: At any rate, the 
uance should be small for this item.and it probably 
should be treated as a constant. 

3. General Administrative - Denote this as ад: 
This indirect cost is not time related or positive group 
related and can easily be divided equally between samples 
analyzed. Again the variance should be small. 

4. Other Overhead - Denote this as PE 
The procedures under consideration in this model require 
variable amounts of total analysis time and, since overhead 
zs related to time utilized in each procedure, it 
might be reasonable to allocate a fixed portion of overhead 
such as utilities, janitorial services and the like to each 
sample analyzed on the basis of a total fraction of labora- 
tory time required to perform each procedure. For example, 
if the laboratory has five full time technicians and 
operates on a 4O hour week basis, the laboratory then has 
200 analysis hours available. If the procedure in question 
requires a total of 20 analysis hours weekly, then allocate 
one tenth of other overhead costs to this procedure. Divide 
the amount allocated to this procedure by the number of 
samples analyzed and treat this as the cost per sample of 


ether overhead. 


| 4 
Now, let a = > а. 
В 


and treat a as a constant in the analysis. 
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Now, having obtained an estimate for each applicable 
Cost category and, acknowledging that there is considerable 
uncertainty associated with most of these estimates, they 
may be aggregated as follows: 

Nota lost) Group = faxed Cost/Gp. + Variablercos o pa 
mien, Expected Total Cost = a + bx + cy = f | 


Fizcd Cosi 


where a 


| ОС ne o unit (ise. dollars/hour or perros. Guy 


‚у = Variable No. Units (time or Pos. Gps.) 


%1 


Тһеп, Variance of Cost - [E.G 9, 5, 8)o] + ər ]* 

+ Е.(%,%,5,8),12 > (с (%,9,5,2)е,12 
as an approximation where f. means derivative of f with 
respect to the variable x. 

With this estimate of the mean and variance of total 
Cost for each of the two procedures in question, it is 
possible to perform hypothesis testing and determine if 
there is a significant difference between the expected 
Beers for the two procedures. In the calculations above, it 
should be noted that in those instances where the variance 
of She variable is small compared to the variance of a 
variable by which it is being multiplied, then the variable 
with the smaller variance can be treated as a constant and 
the computations thereby greatly simplified. 

As shown in appendices 3-8, both the means and variances 
for ihe variables x and y are readily obtained from the 
simulation model. In the laboratory not having access to a 


simulation model such as this, these values may be estimated 
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(roughly) from either existing laboratory data or from 
experimental work done in the laboratory. In either case, 
the following mathematical approach may be used in estimating 


x and y using only the expected value of input parameters. 


DIRECT ESTIMATE USING MEANS 
1. Actual 
The probability of a microorganism entering a group 
he first trial 18 1/5. Then oh each succeeding trial, 
probability statements must be based on the conditional 
probabilities resulting from the first trial. This proce- 
dure gets very complicated after only a few trials. 
2. Estimate 
Using the same initial probability of a micro- 
Organism entering a group (=) and, applying the binomial 
distribution for an average (mean) number of contaminants 
per sample of three, the probability that a sample contains 


one or more contaminants in one or more groups becomes 


3 
> Probability (Number Positive Groups = i) 
= 


Let p - Probability of Positive Group = 1/5 
а= 1 - р = 4/5 
Then, in three trials (3 contaminants/sample) 
P(O Contaminants in a Group) = ort (.2)°(.8)3=.512 
Thus, P(Contaminant in a Group) = 1 - .512 = „488 
от, about .5 of Groups are positive (*2.5 Gps.). Add this 


(ol (lo оше зо пот от а else posative (2) or, on the 
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average of 1 of 5 groups * 1 group/sample, then average 
number of Positive Groups/Sample = 3.5 = y. 

For Procedure A: 

Setup time = 15 minutes (Average) 

Positive Groups = 3.5 (Including False Positives) = y 
Average Tech. Time = 7 min/group = b 


So ao) 


И 


Total Analysis Time Ea 


ama. for Procedure B: 


I 
ж 


Total Analysis Time = 5 x 7 = 35 


From Model (for comparison) 


Procedure A = 34.97 = X, 
Procedure B = 34.93 = Xp 


Finally, it should be recalled that total.analysis costs 
may change with time and quantity of samples analyzed. Most 
laboratory personnel are familiar with the improved effi- 
ciency that normally results from experience with most 
laboratory procedures. In general, this improved efficiency 
can be thought of as a "learning curve" effect. 

Burther, because the rate at which learning occurs with 
one procedure may be significantly different than the rate 
at which uus occurs with another procedure, it follows 
that costs evaluated on the basis of a few experimental 
sample lots may be significantly different than costs eval- 
uated on comparable sample lots when the learning effect is 
taken into consideration. 

Because the learning curve effect is a significant 


Шас солисти houldsbe included in a cost analysis approach 
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to selecting the most efficient laboratory procedure the 
inal sections of this appendix will contaim а ап зсщшезтош 

Ds һе theory and practice of learning curves. This 
discussion is intended to be comprehensive enough for 
application to the problem at hand. For a more complete 
treatment of the subject, the reader is referred to the Rand 


Sibila cation (6) from which most of this material is taken, 


IMEORY OF LEARNING CURVES 

The basis of learning curve theory is that each time the 
total quantity of items produced (samples analyzed) doubles, 
the cost per item (sample) is reduced to a constant percent- 
age of its previous cost. Alternative forms of the theory 
refer to the incremental (unit) cost of producing an item 
at a given quantity or to the average cost of producing all 
items up to a given quantity. For example, if the cost of 


analyzing the 2o sample is 80 percent of the cost of 


analyzing the ious: sample, and if the cost of the aooth 


sample is 80 percent of the cost of the 20002 


and so tort; 
the process of analyzing samples is said to follow an 8O 
K nt unit learning curve. ТЕ the average cost of 
analyzing all 200 samples is 80 percent of the average cost 
of analyzing the first 100 samples, the process follows an 
80 percent cumulative average learning curve. 

Either formulation of the theory results in a power 
e oa is lincar on logarithmic grids. Ir gures 


shows a unii urve for which the reduction an cost 1s 20 


рексеше ee bD l ing or cumulative sample output. 
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The upper figure shows the curve on arithmetic grids and the 
lower on logarithmic grids. The arithmetic plot shows the 
percentage reduction in cost in each sample analyzed is very 
pronounced for the early units. Оп ап 80 percent curve, for 
example, cost decreases to 28 percent of the original value 
over the first 50 units. Over the next 50 samples analyzed, 
ШЕ Сесіппес only 5 more percentage points, i.e., down to 23 
percent of sample number 1 cost. The factors that account 
for the decline in unit cost as cumulative output increases 
are numerous. Obviously, one major contribution is due to 
ИЕ fami liarization by technicians which results from 
repetition of the analytic procedures. Many of the other 
factors are not clearly understood and no attempt will be 
made to enumerate them here. 
Mmérstoc-Linear Hypothesis 

The relationship between cost and quantity may be 
represented by a power (log-linear) equation of the form 


y = ax? 


~ 


where x equals the cumulative quantity of samples analyzed. 
Ihe constant a is the cost of analyzing the first sample. 
The exponent b, which measures the slope of the learning 
curve bears a simple relationship to the constant percentage 
to which the cost is reduced as the number of samples 
amalyzed is doubled. If S represents the fraction to which 


Cost decreases when quantity doubles, the equation becomes 


b 
on 72% EE О = nb e вото 
De Ax Toda 


ий 





This equation shows that for a value of S equal to 75 998m 
cent, the corresponding value of b is 


p or -.415 


mrotting a Curve 


In the graphical display of learning curves, the problem 
1s to represent the average cost for a lot since, typically, 
analysis times or costs are not recorded by sample unit. 


See, for example, the following table: 


Analysis time per 
Lot Sample Units Tot in. minutes 


1 1-10 583 
2 11-20 437 
3 21-50 1,055 
4 51-100 1,475 


To plot a cumulative average curve from these data, the 


cumulative average hours are computed at the final unit in 


each lot: 
Analysis time Cumulative 
Dior Fount per lot (mimi) Computatron  Averede Minutes 
ЕЛО 583 583/10 58.3 
20 437 1,020/20 51.0 
50 1,055 2,075/50 41.5 
100 eae 3,550/100 25,5 
The cumulative average at the 10!" sample unit is 58.3 


Шігшес ІШІ іс Ше riot plot point.  Successive plot 
points en пе епа ог езеп Шот since these ате theo ami 


where the cumulative average minute figures apply. 
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To plot the unit curve it is first necessary to compute 
the unit minutes and then to establish plot points. The 


unit minutes can be taken as an average for each lot: 


Unit 

Lot Computation Minutes 
1 583/10 58.3 
2 437/10 2c 
3 17033 70 25.2 
4 1.475750 29.5 


The lots can be represented by these unit hour values. 
The question is, where should the values be plotted? To 
plot at the lot arithmetic midpoint is to assume that the 
learning curve can be approximated by a linear curve on 
arithmetic grids, but as suggested by Figure 1 such a method 
of approximation only becomes reasonable for lots following 
a large number of previous samples. Thus, when dealing with 
aro linear function, the arithmetic midpoint plot produces 
the unequal distribution of the area under the curve as 
Баат in Figure 2. 

The true midpoint is defined as that unit, xs which 
represents the entire lot and which must also reflect the 
average unit cost, У of the lot? The total cost of the 
lot is equal to the product of v and the number of samples 
the lot, n. This product will approximate the area 
under the curve for n units (see Figure 3). 

In practice, the mathematics associated with determining 


netusdPepPotopornts makes the procedure difficult.  Phererore 
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n n + n/2 n +n 
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mure 2 - Learning curve on arithmetic grids 


True midpoint 


Figure 3 - True lot midpoint on arithmetic grids 
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when dealing with first few lot quantities which comprise 
more than about 25 samples, plot points can be taken from 
graphs provided in the Rand Publication referenced earlier. 
Or, if graphs are not available, estimate the plot points by 
computing the arithmetic lot midpoint and then moving it 
sumghtly to the left. For succeding lots, the arithmetic 
lot midpoint is usually adequate. Consider the following 
example: - 

If the unit and cumulative average curves are plotted as 
shown on Figure 4, then, to determine the learning rate, 
Simply select two cumulative quantities such that the 
second is two times as large as the first, read their 
respective costs from the graph and determine the ratio of 


the respective costs. 


Curve Cumulative Quantity Cost Learning Rate 
ШО Uni t 10 5 4.1/5 or 82% 
20 4.1 
ru Cumulative 10 © 5.1/6 ог 85% 
Average i 
20 Ey 
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100 


10 





(202.1) 


| 2 100 1000 


Cumulative Quantity 


Figure 4 
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